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METHOD AND APPARATUS FOR 
CACHE PROMOTION 



10 PRIORITY CLAIM 

This application claims the benefit of United States Patent Application Serial No. 
10/072,238, filed 05 February 2002 

BACKGROUND 

1 5 Field of the Disclosure 

The present disclosure relates to data communications, and, in particular, to a novel 
system and apparatus for cache promotion. 

The Prior Art 

20 Background 

Once an audio or video work has been recorded it may be both downloaded by users 
for play, or broadcast ("streamed") over the Internet or conventional radio or television 
broadcast or satellite broadcast media. When works are streamed, they may be listened to or 
viewed by Internet users in a manner much like traditional radio and television stations. 

25 The catalog of pre-recorded works that may be streamed over a media channel is 

immense; a broadcaster may choose among millions of pre-recorded works. In managing a 
media stream, broadcasters may differentiate a media stream from competitors by 
constructing a repertoire of works that are performed over a particular stream. For example, 
a broadcaster may limit the works performed over a particular media stream to a specific 

30 genre of music or television show or still image. Today many broadcasters limit their 
repertoire of works performed over a given media stream to several hundred. They may add 
and subtract only a few works from the repertoire every month. 

Prior to the Internet, few media streams were available to reach consumers. For 
example, a radio receiver in a metropolitan area might be within broadcast range of 50 FM 

35 radio stations. When driving in the country, there might be only a handful of FM radio 
stations available to the listener. Furthermore, even fewer television streams were available 
to the consumer. Because of the inherent distance limitations on legacy media streams, a 
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5 media stream in one geographic area would often utilize the same or very similar repertoire 
as a station in another geographic area. By way of example, every metropolitan area may 
have a "country music 59 station that performs essentially the same repertoire of music as 
country music station in other areas. Under these conditions, the actual number of unique 
works in the repertoire of all media in all geographies was relatively small 
10 New technologies have dramatically increased the number of media channels 

available for streaming content. Advances in the art such as local wireless networks, cost 
effective broadband connections, direct satellite broadcast, high quality compression 
techniques and the like are making the number of media channels available to reach 
consumers explode. In the case of new technologies that are geographically neutral, such as 
15 digital media streaming over the internet and direct satellite broadcast, a given media stream 
has a much larger potential audience. In an effort to differentiate media streams and capture 
specific consumers, broadcasters are creating new repertoires for their media streams. As an 
example only, a broadcaster may now find it economically viable to offer a media stream of 
the music of a certain very few performers. One could imagine an audio stream of only 
20 music written or performed by Frank Sinatra. 

This plethora of new repertoires promises to dramatically increase the actual number 
of unique works performed over all media channels. 

Media streams often contain both performances of pre-recorded work and 
extemporaneous work, such announcements or other narrative material Furthermore, media 
25 streams may contain no information about the work being performed, or the information 
provided may be imprecise. 

Given the widespread use of streamed media, audio works, or video works may need 
to be identified. The need for identification of works may arise in a variety of situations. 
For example, an artist may wish to verify royalty payments or generate their own Arbitron®- 
30 like ratings by identifying how often their works are being performed. Thus, playlists of 
media may need to be generated. Additionally, for competitive analysis a business may wish 
to know when and where a competitor is placing advertising in the media. Furthermore, a 
broadcast source may want to know when and how often a competitive broadcast source is 
using pre-recorded material A comprehensive playlist would accurately account for all pre- 
35 recorded works that are streamed over the media. 

Further complicating the identification are improvements in technology allowing a 
tremendous number of new works to be produced, such as new song recordings, new 
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5 advertisements, news worthy audio clips, and station promotions. A comprehensive playlist 
preferably would include these new works, which may be performed over a wide variety of 
media streams. 

Figure 1 shows a playlist generation system 100 of the prior art. The system 100 
may include one or more remote detection modules 102 deployed at various locations 
10 throughout a broadcast area. Each detection module 102 may include an antenna 104 for 
receiving broadcast signals and providing the signals to an analysis and lookup module 106. 
The module 106 is typically configured to identify the content of the received signal by 
comparing its audio content against a database 108 of reference representations of known 
works. 

15 If a match is made, typically the module 102 will keep a record of all matches made 

during a predetermined period of time. For example, the module 102 may keep a record of 
song titles detected during a 24-hour period. 

The system 100 may further include a playlist server 110 having a processor 112 and 
database 1 14. The server 1 10 is typically configured to receive information such as the titles 

20 of identified songs from the one or more detection modules 102 through a network such as 
the Internet 109 and generate a playlist which may be stored on database 114. 

However, the system 100 is typically unable to identify works for which a 
corresponding reference representation does not exist in the reference database. In addition, 
the system 100 must attempt to identify the entire media stream against the entire reference 

25 database. 

As noted above, the entire reference database of all available works may be very 
large. Typical systems today may restrict the number of samples in the reference database 
108 to a certain maximum number that will yield acceptable response time. In some 
implementations, samples of newly released works are included in the active database and 
30 then one of many conventional cache demotion strategies may be used to keep the size of the 
active database below the certain maximum number. For example, Least Frequently Used 
may be an acceptable strategy in this situation. This strategy may leave database 108 
populated with samples of those works which are most often performed over the most media 
streams being monitored. When a work is performed which is not in the active database 108 
35 then the performance may not be detected. 

To ensure that more works are identified, systems in the prior art may retain 
segments of media streams which are not identified in the active database 108 and attempt to 
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5 identify them using a second lookup module configured to use the entire available reference 
database to identify these segments. A multi-tier approach may be used with segments of the 
media streams that are unidentified being passed on to the next tier which is configured with 
a more comprehensive database. 

As noted earlier, media streams contain both pre-recorded works and 

10 extemporaneous performances. Segments of the media stream containing extemporaneous 
performances will not have corresponding entries in any reference database. Thus, all tiers of 
the system may need to attempt to identify all the segments of the media streams which 
contain extemporaneous performances. 

15 SUMMARY 

A method and apparatus are disclosed for managing the cache of a reference 
database. Aspects of disclosed systems may depend upon the identification of repeat 
performances to drive a cache promotion strategy. In one embodiment, a system may 
comprise at least one analysis module for receiving a work from a plurality of sources and 

20 generating a corresponding representation thereof; at least one First Tier identification (ID) 
server for identifying received works; and at least one Second Tier ID server for identifying 
repeating received works. Multi-tiered systems are also disclosed. 

BRIEF DESCRIPTION OF THE DRAWING FIGURES 
25 Figure 1 is a prior art diagram of a system. 

Figure 2 is a block diagram of one aspect of a disclosed system including one or 
more analysis modules and ID servers. 

Figure 3 is a block diagram of one aspect of an analysis module. 

Figure 4 is a flowchart of one aspect of a disclosed system. 
30 Figure 5 is a flowchart of one aspect of a disclosed system. 

Figure 6 is a flowchart of a further aspect of a disclosed system. 

Figure 7 is a flowchart of yet a farther aspect of a disclosed system. 

Figure 8 is a flowchart of multi-server aspect of a disclosed system. 

35 DETAILED DESCRIPTION 

Persons of ordinary skill in the art will realize that the following description is 
illustrative only and not in any way limiting. Other modifications and improvements will 
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5 readily suggest themselves to such skilled persons having the benefit of this disclosure. 

This disclosure may relate to data communications. Various disclosed aspects may be 
embodied in various computer and machine readable data structures. Furthermore, it is 
contemplated that data structures embodying the teachings of the disclosure may be 
transmitted across computer and machine readable media, and through communications 

10 systems by use of standard protocols such as those used to enable the Internet and other 
computer networking standards. 

The disclosure may relate to machine readable media on which are stored various 
aspects of the disclosure. It is contemplated that any media suitable for retrieving 
instructions is within the scope of the present disclosure. By way of example, such media 

15 may take the form of magnetic, optical, or semiconductor media. 

Various aspects of the disclosure may be described through the use of flowcharts. 
Often, a single instance of an aspect of the present disclosure may be shown. As is 
appreciated by those of ordinary skill in the art, however, the protocols, processes, and 
procedures described herein may be repeated continuously or as often as necessary to satisfy 

20 the needs described herein. Accordingly, the representation of various aspects of the present 
disclosure through the use of flowcharts should not be used to limit the scope of the present 
disclosure. 

Exemplary structure 

25 Figure 2 is a schematic diagram of one aspect of a disclosed playlist generation 

system 200. The system 200 may include at least one analysis module 202 for receiving a 
media stream from a broadcast source 204 as is known in the art. The analysis module may 
include one or more conventional antennae 214 coupled to the analysis module 202 through 
an input port 210. The input port 210 may include a conventional receiver for the reception 

30 of desired broadcast signals. The input port 210 may also be configured to provide remote 
control functionality for allowing the remote control and configuration of the receiver, such 
as providing for remote tuning. The input port 210 may be further configured to digitize 
received signals in digital formats using protocols known in the art, such as PCM. 

The analysis module 202 may also be configured to receive a media stream from one 

35 or more networked sources 206. In one aspect of a disclosed system, the input port 210 of 
the analysis module 202 may be configured to monitor sources providing content in standard 
formats such as Real®, QuickTime®, Windows Media®, MP3®, and similar formats, using 
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5 hardware and software as is known in the art. 

In another aspect of a disclosed system, the input port 210 may be configured to 
directly receive audio or video through any of the various means know in the art, such as a 
microphone, video acquisition system, VHS tape playback unit, or audio cassette playback 
unit. These media streams may also be provided in standard formats such as MPS, Windows 

10 Media, and similar formats. Thus, the analysis module 202 may be configured to receive a 
work prior to the work being presented to the broadcast system or network source. It is 
envisioned that this presentation could occur almost simultaneously. 

The input port 210 may be operatively coupled to a network 208 through which the 
source 206 may be accessed. The network 208 may comprise any packet- or frame-based 

15 network known in the art, such as the Internet. The input port 210 may also be configured to 
access the network 208 through any means known in the art, such as through traditional 
copper connections. Furthermore, the input port 210 may also be configured to access the 
network 208 using wireless connectivity methods as known in the art, including low-power 
broadband methods such as Bluetooth®, or cellular-based access methods such as those used 

20 to provide wireless connectivity to cellular phones and personal digital assistants (PDAs). 

The analysis module 202 may also include an output port 212 for providing 
connectivity to the network 208. The output port 212 may comprise a separate unit within 
the analysis module 202 and may include hardware and software to provide the same 
functionality as the input port 210. Additionally, it is contemplated that the output port 212 

25 may comprise substantially the same circuitry as the input port 210 in order to save space 
and cost. 

Referring now to Figure 3, a conceptual block diagram of one aspect of a disclosed 
analysis module 202 is shown. The analysis module 202 may include a processor 302 for 
operating various aspects of the present disclosure. The processor 302 may be operatively 
30 disposed between the input port 210 and output port 212. 

It is contemplated that any processor known in the art may be employed in the 
module 202, and the choice of a processor may depend on the application. For example, if 
the module 202 is embodied in a personal computer, the processor 202 may comprise a 
microprocessor capable of running conventional operating systems such as Microsoft 
35 Windows®, while if the module 202 is deployed in a mobile unit such as a PDA, the 
processor 202 may need only be capable of running an operating system such as Palm OS®, 
or other embedded systems such as may be present in a cell phone or other consumer device. 
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5 The module 202 may include ancillary hardware and software, such as conventional 

memory 304 and a conventional database 306 for the storage and retrieval of various aspects 
of the disclosed system and data. 

The module 202 may be configured to generate a representation of received work 
which may then be used by the system to identify performed works contained in the received 

10 work. It is contemplated that a wide variety of methods may be used by the analysis module 
202 to generate the representation. The analysis module may be configured to generate a 
representation of the received work using the psychoacoustic properties of the audio content 
of the received work. Such methods are known in the art. For example, the analysis module 
may generate feature vectors as disclosed in US Patent #5,918,223 to Blum, et al, which is 

15 assigned to the same assignee of the present disclosure and incorporated by reference as 
though fully set forth herein. 

Additionally, the module 202 may use audio or video spectral or wavelet 
representation techniques as are known in the art. For example, other representation forms 
may comprise the text output of a speech recognition system, text output of a close captioned 

20 transmission, or a musical score produced by a music transcription system. In another 
embodiment, the representation may comprise a bit calculated key using any of the 
techniques as are known in the art such as MD5 hash and CRC. It is contemplated that a 
wide variety of analysis methods may be employed singly or in combination advantageously 
in the present disclosure. 

25 Referring back to FIG. 2, the system 200 may further include at least one 

identification (ID) server 220 for identifying a received work. The ID server 220 may 
identify received works using a representation received from the analysis module 202 
through network 208. Though FIG. 2 shows the ID server 220 coupled to the same network 
208 as the analysis module 202, it is to be understood that the various components of the 

30 present disclosure may be coupled to different networks at different times. 

The ID server 220 may comprise a computer suitable for running an operating 
system such as Microsoft Windows®, UNIX®, LINUX®, MAC OS®, and the like. The ID 
server 220 may include a conventional processor 222 for operation of the server. The ID 
server may further include associated hardware and software known in the art such as a 

35 conventional database 224 for storing embodiments of the disclosure or data. 

It is contemplated that the ID server 220 may be configured to identify received work 
using a variety of methods known in the art. The method for identification may correspond 
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5 to the method(s) used to generate the representation within the analysis module. For 
example, the ID server 220 may be configured to perform identification using the methods 
disclosed in US Patent #5,918,223 to Blum, et al, if the representation were generated using 
corresponding methods. 

Another example would be the pure spectral representations as are known in the art. 

10 It is envisioned that other representations such as wavelets may be used. The invention 
could also identify the received work from the speech recognized text compared against a 
database of song lyrics using any of a variety of methods known to those skilled in the art. 

Yet another example would be any of a number of search techniques as are known in 
the art when the representation is a bit calculated key. 

15 The system may also identify the received work by searching a collection of musical 

works for musical note sequences that correspond to the musical score in the representation. 

In another configuration the system may use a combination of identification 
techniques, each of which correspond to a representation of the received work. By using 
several identification techniques, the chance of a misidentification or missed identification 

20 may be greatly reduced. 

Though the analysis module and ID server are shown as being located separately, it 
is contemplated that they also may be co-located in a single server. For example, it is 
contemplated that the analysis module and ID server may each be embodied in a single 
board computer wherein the analysis module and ID server are housed in a single unit and 

25 operatively coupled through a common backplane. 

Exemplary operation 

Figure 4 is a flowchart of one aspect of a disclosed method for automatically 
generating a playlist. The process begins in act 400, where at least one media stream is 
30 received by an analysis module. The analysis module may comprise hardware and software 
substantially as shown and described above. 

Additionally, one or more of the analysis modules may be configured to receive a 
plurality of stream sources simultaneously for analysis. It is contemplated that the analysis 
modules may be located and configured to receive and analyze a wide variety of content, 
35 including analog radio or video, digital streaming audio or video, VHS tape, audio cassette 
tape or any other media. 

In act 402, the analysis module then creates a representation of the received work as 
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5 shown and described above. The representation may be created by the analysis module by 
extracting psychoacoustic properties from the received work as described above. 

In act 404, the representations created by the one or more analysis modules may be 
provided to an ID server. The ID server may comprise hardware and software as described 
above. It is contemplated that the ID server may comprise a single server, multiple servers 
10 networked at a single location, or multiple servers located at different locations. 

It is contemplated that the various analysis modules may provide representations to 
one or more ID servers in a wide variety of manners. For example, all of the analysis 
modules present in a system may provide representations in real-time. Or, different analysis 
modules may be configured to provide representations at different intervals depending on the 
15 needs of the end user. The analysis modules may transmit representations every sixty 
seconds, hourly, or as often as is needed. 

In some cases where network connectivity is challenging, the representations may be 
batched up and sent to the ID server(s) once a day or less. In particularly harsh or secretive 
conditions, the representations may be stored within the analysis modules until the modules 
20 could be physically retrieved and operatively coupled to an ID server at another physical 
location. 

It is contemplated that an out-of-band event may be used to trigger the transmission 
of representations. For example, such a trigger may comprise the initialization of a 
connection to a network, or the activation of media playing software or hardware. 
25 Figure 5 is a flowchart of a further disclosed aspect of a disclosed method. The 

process begins with act 500, where an ID server receives at least one representation of a 
received work. The representations may be received from analysis modules as described 
above. 

In act 502, the ID server identifies portions of the received work based upon the 
30 representation. This identification may be performed using the methods as described above. 
The identification may include such information as the song title, artist, label, or any other 
information as is known in the art that may be associated with the work. The identification 
information might contain information such as the name of the advertiser or a descriptive 
notation of an FCC broadcaster identification segment. The identification information might 
35 contain a narrative description of a news segment. 

Once an identification of a received work is made, it is contemplated that a wide 
variety of further acts maybe performed. For example, the identifications made by the ID 
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5 server may be used to construct or maintain a playlist database. Such a playlist may be 
stored on the ID server, or on a distant server. As will be appreciated by those skilled in the 
art, if representations are provided to the ID server in real-time (or near real-time depending 
on the equipment or network used), a playlist may be generated in corresponding real-time. 
Thus, a playlist may be generated in real-time from inputs provided from distant geographic 
10 locations or multiple sources that contains a comprehensive playlist of every identified 
media segment. 

Additionally, the identification may be transmitted back to the analysis module 
which generated the representation. This may be advantageous where it is desired to 
generate a playlist for the particular analysis module's location or user. Thus, the ID server 

15 may be configured to provide an identification back to the source analysis module. 

The identity of the received work may also be used for the maintenance of the 
system. Typically, copies of received works are stored on local drives for audit purposes. 
Since the intermediate representation files may be larger in size than the identities, it may be 
desirable to configure the analysis module to purge intermediate representations for 

20 identified works to recover drive space. It is contemplated that the ID server may be 
configured to transmit the identity of received works back to the generating analysis module, 
and the corresponding part of the representation may then be deleted from local drives by the 
analysis module, thereby recovering valuable capacity. 

Furthermore, it is contemplated that the ID server or analysis module may be 

25 configured to send information regarding identified works to third parties, such as third- 
party servers. Additionally, the ID server or analysis module may be configured to provide 
an electronic notification to third parties of identifications made by the ID server. Examples 
of electronic notifications may include email, HTTP POST transactions, or other electronic 
communication as is known in the art. As is known by those skilled in the art, these 

30 electronic notifications may be used to initiate an action based on their content. For 
example, such notifications may allow the playlist to be accessed in real-time or as desired. 

It is contemplated that the ID server may be configured to provide customized 
play lists containing information tailored to a customer's individual needs. For example, a 
customer may wish to be notified whenever a certain work is broadcast, or whether a 

35 particular work is broadcast on a particular media outlet. Customers may wish to have 
complete playlists provided to them periodically at desired intervals that may include 
statistics known in the art. By using the system as disclosed herein, such requests may be 
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5 satisfied automatically in real-time, or at whatever interval may be desired. It is to be 
understood that any of the aspects of the present disclosure may be performed in real time or 
as often as desired. 

Unidentified segments 

10 During the process described above, the received work presented to the system may 

contain segments which may not be identified. In an aspect of a disclosed system, such 
unidentified segments may be examined to provide useful information. For example, if a 
particular unidentified segment is repeated often it may contain a new song or commercial or 
other pre-recorded work that warrants further action. 

15 In one aspect of a disclosed system, the ED server may examine the representations of 

unidentified segments, and determine that some sub-segments were actually repeat 
performances of a single work. Furthermore, this examination may extract a plurality of 
other characteristics of the original broadcast such as the amount of musical content, amount 
of speech content, a transcription based on speech recognition, the beat of any music present, 

20 etc. These characteristics of the unidentified segments may then be used to classify the 
unidentified received representations. 

For example, a sub-segment that has been performed more than once may be 
correlated with a high amount of musical content and a certain minimum length of play time 
to indicate that a new song has been detected. Correlating other values and characteristics 

25 could indicate that a new advertisement has been detected. In some cases a corresponding 
segment of the original broadcast signal could be retrieved and played for a human to 
perform an identification. 

Figure 6 is a flow diagram of a method for identifying new media content. The 
process of FIG. 6 may be performed by any of the structure disclosed in this disclosure. 

30 The process of FIG. 6 begins in act 600, where a media stream is received. The 

received media content may be delivered over any transmission medium as disclosed above. 
The process of FIG. 6 may receive content from one or more media streams. It is 
contemplated that the received work may comprise representations provided by an analysis 
module to an ID server as described above. In the discussion that follows, the terms 

35 segments and representations may be used interchangeably. Furthermore, when the 
following discussion discloses operations on a segment, that same operation may be 
performed on any part of the segment, or a sub segment, where the sub segment may overlap 
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5 other sub segments. 

In query 602, the system determines whether the received work can be identified. If 
the work can be identified, the work may be identified in act 604. The determination and 
identification acts may be performed as disclosed above. 

If the received work cannot be identified, then the unidentified segment may be 
10 reported to the system in act 606. It is contemplated that the unidentified segment may be 
indexed and cataloged. Additionally, a list of unidentified segments may be generated. 

Figure 7 is a flow diagram of a further aspect of identifying new media content. The 
process of FIG. 7 begins in query 700, where the system waits to receive an unidentified 
segment. 

15 In query 702, it is determined whether the received unidentified segment is similar to 

any part of any previously received unidentified segment. In one embodiment, the analysis 
performed in query 702 may comprise decomposing each unidentified segment into a series 
of overlapping 5-second sub segments and comparing each unidentified sub segment against 
other unidentified sub segments. It is contemplated that a wide variety of similarity 

20 measurement techniques may be used, such as those used to identify segments as disclosed 
above. For example, a threshold for similarity may comprise the vector distance between 
unidentified segments computed as disclosed above. The choice of similarity measurement 
may dictate the length of the matching sub segments discovered. 

If the unidentified segment is not determined to be similar to a previously received 

25 unidentified segment, then the segment may be indexed and cataloged in act 704. Such a 
segment may then serve as a reference against which future unidentified segments may be 
compared. 

If an unidentified segment is determined to be similar to a previously received 
unidentified segment, the system may conclude that similar unidentified segments may be 

30 performances of the same work, e.g., from the same master recording. When the similarity 
comparison process indicates that the unidentified sub segment is from the same work as 
another unidentified sub segment, then the system may attempt to extend the length of the 
similar unidentified segments by 'stitching' together contiguous unidentified sub segments 
which also meet the criteria of being performances of the same work. These extended 

35 segments consisting of similar earlier and later unidentified segments is referred to herein as 
a "super segment". 

Groups of super segments may be created which consist of contiguous runs of 
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5 unidentified segments collected from one or more media streams that may all be 
performances of the same work. It is contemplated that super segments may comprise any 
length, and may preferably have a length corresponding to standard media lengths such as 
15, 30 seconds, 60 seconds, 13 minutes, or even an hour. Of course, other lengths may be 
used. 

10 In a further exemplary embodiment, once a super segment has been created it will be 

included in the process of FIG. 7. Thus, newly received unidentified segments may be 
continuously added to a particular super segment if it is determined to be similar to any of 
the unidentified segments which are contiguous with another super segment in the same 
group of super segments. This action may extend the length of each super segment in the 

15 particular group of super segments. Thus, through the analysis of the present disclosure, a 
listing may be produced which includes the largest possible repeating segments across all 
time and across all the monitored media streams. 

These repeating segments may contain valuable information and may be reported on. 
In one embodiment, super segments may be reported on by length. For example, any 

20 repeating segments less than 63 seconds in length may represent advertisements, news 
segments, or station promotions. In another embodiment, any repeating segments between 2 
and 15 minutes may indicate a song. Additionally, longer repeating segments may indicate 
an entire broadcast is being repeated, such as a radio talk show or TV show. 

It is contemplated that the ID server as disclosed above may perform the process of 

25 FIG. 7. Once super segments are found, it is contemplated that a wide variety of further acts 
maybe performed. For example, the unidentified repeating segments found by the ID server 
may be used to construct or maintain a new works playlist database. Such a new works 
playlist may be stored on the ED server, or on a distant server. These repeating segments may 
be merged into a playlist report of identified media, thus making the playlist comprehensive 

30 of all master recordings. As will be appreciated by those skilled in the art, if representations 
are provided to the ID server in real-time (or near real-time depending on the equipment or 
network used), a new works playlist may be generated in corresponding real-time. Thus, a 
new works playlist may be generated in real-time from inputs provided from distant 
geographic locations or multiple sources that contains a comprehensive playlist of every 

35 unidentified media segment or super segment. 

Often a substantial time interval will pass between performances of a work over a 
given media stream. However, the same work is often performed on several different media 
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5 streams. The time between performances of the same work on different media streams may 
be far less than the time between performances of the work on any one media stream. 
Furthermore, advertisements may often play concurrently over several different media 
streams as the advertiser tries to achieve great consumer impact. Thus, the system described 
herein will preferably recognize a new work as soon as it is performed a second time over 

10 any monitored media stream. 

In a further aspect, the unidentified segments and super segments may be transmitted 
back to the analysis module which generated the representation. This may be advantageous 
where it is desired to generate a new work playlist for the particular analysis module's 
location or user. Thus, the ID server may be configured to provide unidentified segments or 

15 super segments back to the source analysis module. In this case, the source analysis module 
may decide to hold the original source audio corresponding to the new work super segment 
for future identification through more traditional, human based, methods. 

It is envisioned that the number of reference media samples may grow large on an ID 
Server, possibly resulting in system performance degradation. In certain aspects of the 

20 disclosed system, it may therefore be desirable to limit the size of the reference database. In 
one exemplary embodiment, an ID Server may use a Least Recently Used (LRU) strategy to 
remove from its reference database any reference samples for media works that have not 
been identified in a media stream in a predetermined amount of time. The LRU cache 
demotion strategy may be implemented as is well known to those skilled in the art. 

25 

Multi-tier server aspect 

To further improve the overall performance of the system, the workload for 
identifying works may be distributed over multiple servers. For example, a multi-tiered 
configuration may be deployed using a First Tier ID Server and a Second Tier ID Server, 

30 configured substantially as described above, but utilizing different reference databases. The 
First Tier ID Server may be configured to keep only a working subset of all reference 
samples in an active identification database. For example, the First Tier ID server may 
include a reference database of segments expected to be detected. This working subset may 
be comprised of both known reference samples and super segment groups. A larger 

35 reference set may be in use on a Second Tier ID Server that may include segments and super 
segment groups. In this configuration the First Tier ID Server may send representations of 
unidentified super segments to the Second Tier ID Server for an attempted identification 
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5 against the larger reference set. 

Figure 8 is a flowchart of a two-tiered identification aspect of a disclosed system. 
The process may begin in act 800 where it is determined whether a repeating work has been 
received. If an unidentified work has been received more than once, known as a super 
segment group, then the First Tier ID Server may send a super segment from the group to a 

10 Second Tier ID Server for identification in act 802. In the aspect of FIG. 8, the Second Tier 
ID Server only attempts to identify detected super segments. In other words, only repeating 
segments are sent to the Second Tier ID Server for an identification attempt. An 
unidentified work which has failed to be detected more than once is not sent to the Second 
Tier ID Server for identification. 

15 As will be appreciated by those skilled in the art, such a strategy can significantly 

reduce the load on the Second Tier ID Server by preventing extemporaneous works from 
being sent to the Second Tier ID Server for an attempt at identification, which is destined to 
fail. Essentially, extemporaneous performances may be culled from the representation of the 
media stream by the First Tier ID Server. 

20 If the Second Tier ID Server is successful in identifying the super segment, then the 

Second Tier ID Server may alert the First Tier ID Server to add the reference sample now 
known to correspond to the super segment into the First Tier ID Server reference database. 
In addition, the First Tier ID Server may also use the results of the Second Tier ID Server 
identification as if the First Tier ID Server had made the identification. It will be 

25 appreciated that by this act the number of unidentified segments sent to the Second Tier ID 
Server can be dramatically reduced through the optimization of the set of reference samples 
in use by the First Tier ID Server. Thus, the identification of a super segment by a Second 
Tier ID Server may drive the reference database cache promotion strategy. 

As will be appreciated by those skilled in the art, the culling and cache promotion 

30 strategies described above could be utilized in a system consisting of any number of tiers of 
ID Servers. For example, each successive tier (n+1) may have a larger database than the 
previous tier (n). Each tier may be composed of a number of servers with the same database 
all working in parallel. In this strategy, the system may attempt identification against all tiers 
at the same time, in parallel. When one tier reports identification, the system may abort the 

35 in-process identifications on the other tiers. 

Alternatively, more tiers may be deployed with successively smaller databases (n~l) 
until a desired price-performance point is reached. Additionally, as the size of the total 
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5 database grows, additional n+1 tiers may be deployed. 

Furthermore, it is contemplated that the ID server or analysis module may be 
configured to send information regarding detected new works to third parties, such as third- 
party servers. Additionally, the ID server or analysis module may be configured to provide 
an electronic notification to third parties of new work detection made by the ID server. 

10 Examples of electronic notifications may include email, HTTP POST transactions, or other 
electronic communication as is known in the art As is known by those skilled in the art, 
these electronic notifications may be used to initiate an action based on their content. For 
example, such notifications may allow the new works playlist to be accessed in real-time or 
as desired. The identification of a new work may be used to raise an alert that a new 

15 advertisement, song, or news clip has just been released to media casters. 

It is contemplated that the ID server may be configured to provide customized new 
works playlists containing information tailored to a customer's individual needs. For 
example, a customer may wish to be notified whenever a new work with certain 
characteristics, as described above, is detected, or whenever a particular type of new work is 

20 detected on a particular media outlet. For example, new works reports may be generated 
which classify unidentified segments based on length. Customers may wish to have 
complete new works playlists provided to them periodically at desired intervals that may 
include statistics known in the art. By using the system as disclosed herein, such requests 
may be satisfied automatically in real-time, or at whatever interval may be desired. It is to 

25 be understood that any of the aspects of the present disclosure may be performed in real time 
or as often as desired. 

While embodiments and applications have been shown and described, it would be 
apparent to those skilled in the art that many more modifications and improvements than 
mentioned above are possible without departing from the inventive concepts herein. The 
30 disclosure, therefore, is not to be restricted except in the spirit of the appended claims. 
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5 What is claimed is: 

1. A new media identification system comprising: 

at least one analysis module for receiving work from a plurality of sources and 

generating a corresponding representation thereof; 
at least one First Tier identification (ID) server for identifying received works; 
10 and 

at least one Second Tier ID server for identifying repeating received works. 

2. The system of claim 1 7 wherein said at least one analysis module further includes an 
input port configured to receive said received work from a networked source. 

3. The system of claim 1, wherein said at least one analysis module further includes an 
15 input port configured to receive said received work from a broadcast source. 

4. The system of claim 1, wherein said at least one analysis module further includes an 
input port configured to receive said received work in the form of a pre-broadcast 
digital form. 

5. The system of claim 1, wherein said at least one analysis module and said at least one 
20 ID server may be coupled over a network. 

6. The system of claim 5, wherein said network comprises the Internet. 

7. The system of claim 1, wherein said corresponding representation comprises feature 
vectors. 

8. The system of claim 1, wherein said corresponding representation comprises a 
25 spectral representation of said received work. 

9. The system of claim 1, wherein said corresponding representation comprises the text 
output of a speech recognition system. 

10. The system of claim 1, wherein said corresponding representation comprises the 
musical score output of a music transcription system. 

30 11. The system of claim 1, wherein said corresponding representation comprises a bit 
calculated key. 

12. The system of claim 1, wherein said First Tier ID server is configured to identify said 
received work using feature vectors. 

13. The system of claim 1, wherein said First Tier ID server is configured to identify said 
35 received work using a spectral representation of said received work. 

14. The system of claim 1, wherein said First Tier ID server is configured to identify said 
received work using the text output of a speech recognition system. 

17 



WO 03/067459 



PCT/US03/03505 



15. The system of claim 1, wherein said First Tier ID server is configured to identify said 
received work using the musical score output of a music transcription system. 

16. The system of claim 1, wherein said First Tier ID server is configured to identify said 
received work using a bit calculated key. 

17. The system of claim 1, wherein said First Tier ID server is configured to identify said 
received work using feature vectors. 

18. The system of claim 1, wherein said Second Tier ID server is configured to identify 
said received work using a spectral representation of said received work. 

19. The system of claim 1, wherein said Second Tier ID server is configured to identify 
said received work using the text output of a speech recognition system. 

20. The system of claim 1, wherein said Second Tier ID server is configured to identify 
said received work using the musical score output of a music transcription system. 

21. The system of claim 1, wherein said Second Tier ID server is configured to identify 
said received work using a bit calculated key. 

22. The system of claim 1, wherein said at least one analysis modules are further 
configured to receive a plurality of streaming sources for analysis at a single location. 

23. The system of claim 1, wherein said at least one analysis modules are further 
configured to receive a plurality of streaming sources for analysis at a plurality of 
different access points of the network. 

24. The system of claim 1, wherein said at least one analysis module is configured to 
provide said representations to said at least one First Tier ID server at a 
predetermined time interval. 

25. The system of claim 24, wherein said predetermined time interval comprises at least 
once a day. 

26. The system of claim 24, wherein said predetermined time interval comprises 
approximately once an hour. 

27. The system of claim 24, wherein said at least one analysis module is configured to 
provide said representations to said at least one First Tier ED server in approximately 
real time. 

28. The system of claim 24, wherein said at least one analysis module is configured to 
provide said representations to said at least one First Tier ID server based on an out- 
of-band event. 
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29. The system of claim 1, wherein said First Tier ID server is further configured to 
generate a playlist of identified works. 

30. A method for identifying new media comprising: 

receiving a segment of a media stream by a First Tier ID server; 

attempting, by said First Tier ID server, to identify said segment; 

if identification is not possible, then, determining whether said segment is similar 

to previously received unidentified segments; and 
if said segment is similar to a previously received unidentified segment, then 

sending said segment to a Second Tier ID server for identification. 

31. The method of claim 30, wherein said Second Tier ID server includes a plurality of 
tiers of ID servers. 

32. The method of claim 30, further including the act of providing a reference database 
of segments expected to be detected on said First Tier ID server. 

33. A system for managing the contents of an n-Tiered ID System comprising: 

a Tier N server including a database; 
at least one Tier N+l server; and 

wherein said Tier N server is configured to send repeating unidentified segments 
to said Tier N+l for identification. 

34. The system of claim 4, wherein said Tier N+l server is configured to notify said Tier 
N server of a repeating segment if a repeating segment is identified. 

35. The system of claim 33, wherein each successive Tier N+l server includes a database 
larger than the previous tier. 

36. The system of claim 35, wherein said successive tiers operate in parallel. 

37. The system of claim 36, wherein the operation of said successive tiers is aborted 
upon the identification of an unknown segment by a member of said successive tiers. 

38. The system of claim 33, further including a set of smaller Tier-1 ID servers having 
databases smaller than said Tier N server. 

39. A system for identifying new media comprising: 

means for receiving a segment of a media stream by a First Tier ID server; 
means for attempting, by said First Tier ID server, to identify said segment; 
means for determining whether said segment is similar to previously received 
unidentified segments if identification is not possible; and 
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means for sending said segment to a Second Tier ID server for identification if 
said segment is similar to a previously received unidentified segment. 

40. The system of claim 39, wherein said Second Tier ID server includes means for 
providing a plurality of tiers of ID servers. 

41. The method of claim 39, further including means for providing a reference database 
of segments expected to be detected on said First Tier ID server. 

42. A program storage device readable by a machine containing a set of instructions to 
perform a method by the machine, the method comprising: 

receiving a segment of a media stream by a First Tier ID server; 

attempting, by said First Tier ID server, to identify said segment; 

if identification is not possible, then, determining whether said segment is similar 

to previously received unidentified segments; and 
if said segment is similar to a previously received unidentified segment, then 

sending said segment to a Second Tier ID server for identification. 

43. The device of claim 42, wherein said Second Tier ID server includes a plurality of 
tiers of ID servers. 

44. The device of claim 42, further including the act of providing a reference database of 
segments expected to be detected on said First Tier ID server. 
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